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(54) Analysis of protein etc. using mass spectrometry 



(57) Methods are provided for determining the presence or absence of ragged ends in polymeric proteins or 
protein products, by means of fast atom bombardment and high mass mass spectrometry (preferably high 
field magnet mass spectrometry). The methods may be modified for locating N- or C- terminal peptide in a 
polymeric protein or protein product and for assigning disulphide bridges in polymeric protein or protein 
products. The methods may be applied to other biopolymers such as nucleotides and carbohydrates. 
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SPECIFICATION 

Analysis of polymeric protein and protein 
products 

5 

This invention relates to a method of analys- 
ing a polymeric protein or protein product for 
the presence or absence of a ragged end at 
its C-terminus and/or N-terminus. The inven- 
10 tion is also capable of locating and identifying 
microheterogeneity in recombinant protein pro- 
ducts, and capable of assigning disulphtde 
bridges. 

When producing a protein product by 

15 means, for example, of recombinant DNA bio- 
technology, it is possible for the protein pro- 
duct produced to be shorter or longer than 
expected due to one or more amino acids in 
the amino acid sequence being missing from 

20 or added to either the N-terminus, or, more 
usually, the C-terminus of the protein product. 
Such a deficiency is referred to as a ragged 
end. The absence or addition of such amino 
acids can effect the usefulness or acceptability 

25 of the protein product so it is important to 
know whether such a ragged end exists and 
to know the constitution of the end portion(s) 
of the protein product. 
The traditional method for ascertaining the 

30 structure of a polymeric protein is to degrade 
the polymer into component parts, to isolate 
and purify each part and then to perform 
structural studies upon each part by stepwise 
chemical degradation (Edman degradation). 

35 This work is extremely time-consuming be- 
cause of the purification steps required and 
moreover suffers from the following difficul- 
ties; 

1) inability to cope with blocked N-termina! 
40 proteins or peptides 

2) the inability to identify new post-transla- 
tional modifications, insertions, deletions, or 
chemical changes such as phosphorylation or 
glycosylation 

45 3) difficulty in assigning C-terminal se- 
quences i.e. 

at the end of the molecule opposite to 
where degradation is being effected. 
In fact if a mixture of proteins, is being 

50 analysed, the method cannot be used to reli- 
ably derive complete information about the N- 
or C-terminus. 

it has been suggested to perform mass- 
spectrometric analysis of proteins using fast 

55 atom bombardment analysis in a high mass or 
a high field magnet mass spectrometer but 
there has been no suggestion that these tech- 
niques can be used to look for ragged ends at 
the N-or C-terminus in the protein or protein 

60 product, or to detect microheterogeneity in 
proteins or to assign disulphide bridges. 

According to a first aspect of the present 
inventi n there is pr vided a m th d of ana- 
lysing a polymeric protein or protein pr duct 

65 for the presence or absence f a ragg d end 



(as hereinbef re defined) at its C- or N-termi- 
nus which method comprises the steps of: 

a) carrying out a chemical or enzymatic di- 
gestion upon a said polymeric protein or pro- 

70 tein product to produce N- or C-terminus frag- 
ments which are differentiable from non-termi- 
nal fragments of the digested protein or pro- 
tein product; 

b) subjecting the N- and/or C-terminus frag- 
75 ments obtained to fast atom bombardment 

and high mass mass spectrometry; 

c) analysing the mass spectrometric data 
obtained and checking for the occurrence of 
fragments having a mass which would be un- 

80 expected from a study of a known complete 
molecule of the protein or protein product 
which is believed to be present, the occur- 
rence of such unanticipated fragments indicat- 
ing the presence of a said ragged end. 
85 A similar methodology can be used to de- 
tect unanticipated protein fragments which in- 
dicate the presence of microheterogeneity or 
of blocking or of post-translational groups. 
The term "differentiable" employed in the 
90 foregoing definition of the invention is in- 
tended to mean that the respective terminal 
and non-terminal fragments are clearly distin- 
guishable from each other by mass spectro- 
metric techniques. The terminal fragments may 
95 be distinguished from non-terminal fragments 
either on the basis that terminal fragments are 
likely to be considerably smaller than non-ter- 
minal fragments, e.g. 1/2 to 1/3 of their 
mass, or on the basis of expected mass 

100 where the expected composition of the pro- 
tein or protein product is known and the re- 
sults of digestion can therefore be predicted. 

The above invention is most useful for 
checking the constitution of a protein or pro- 

105 tein product where the amino acid sequence 
of the whole protein or protein product which 
is believed to be present is known. The inven- 
tion is also utilizable in those cases where the 
complete amino-acid sequence of the N- an- 

110 d/or the C-terminus is unknown. 

The foregoing invention can be extended so 
that not only can the mass of the terminus 
fragment be determined but also the amino 
acid sequence of that fragment. In general any 

115 specific mass is almost unique to a particular 
group of specific amino-acids. The specific se- 
quence of the amino acids within the fragment 
can be determined by further digestion of the 
protein using a different digestion agent caus- 

120 ing cleavage at different sites and hence the 
formation of subfragments. 

Sometimes the step of fast atom bombard- 
ment and high field magnet mass spectrome- 
try will cause the terminal fragments to break 

125 up and produce ionic subfragments. The 

masses of the ionic subfragments can be used 
to assess the mak up and sequence of the 
amino acid residues in the terminal fragment. 
In such a case there would be no need for the 

130 furth r dig sti n st p r f rred to abov . 
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Sometimes the mass of a fragment may 
convey ambiguous information about the am- 
ino acid c ntent of the peptide fragment; in 
such a case a further digestion to produc 
5 smaller peptide fragments or a digestion of 
further starting material using a different diges- 
tion technique (such as CNBr, Edman or endo- 
or exopeptidases) to produce cleavage at dif- 
ferent sites, can resolve the ambiguity. Infor- 
10 mation can also be obtained by analysis of a 
chemical derivative or by measurement of the 
accurate mass of the peptide fragments, e.g. 
by standard methods, to resolve the ambigu- 
ity. 

1 5 According to a second aspect of the pre- 
sent invention there is provided a method for 
locating the N- or C-terminal peptide in a poly- 
meric protein or protein product which method 
comprises the steps of: 

20 a) carrying out a chemical or enzymatic di- 
gestion upon a said product to produce N- or 
C-terminus fragments which are differentiate 
(as hereinbefore defined) from non-terminal 
fragments, of the digested protein or protein 

25 product; 

b) subjecting the protein fragments obtained 
to fast atom bombardment and high mass 
mass spectrometry; 

c) carrying out a second and different diges- 
30 tion upon the products of step a); 

d) subjecting the protein fragments of step 
c) to fast atom bombardment and high mass 
mass spectrometry; and 

e) analysing and comparing the mass spec- 
35 trometric data obtained from both mass spec- 
trometry exercises to establish whether or not 
some or all of the mass spectrometry signal 
peaks have shifted and deducing from this the 
location of the terminal peptide. 

40 In addition to location of the terminal pep- 
tide it is possible to identify all of the amino 
acid residues in the terminal fragment and to 
work out the sequence of those amino acid 
residues in the terminal fragment. This is 

45 achieved by conducting an additional digestion 
to remove a final amino acid residue from the 
terminal fragment using e.g. an Edman degra- 
dation or an exopeptidase, such as carboxy- 
peptidase Y, together with fast atom bom- 

50 bardment and high mass mass spectrometry, 
before the digestion step a) of the second 
aspect of the present invention. The effect of 
this extra digestion is to remove the final am- 
ino acid, residue and allow it to be undividually 

55 identified from the mass spectrometry results. 
This step can be repeated several times to 
identify the sequence of the amino acid resi- 
dues in the terminal fragment. It can be per- 
formed to identify the sequence at either the 

60 N-terminus or at the C-terminus. Any suitable 
chemical or enzymatic digestion may be em- 
ployed in st p a) of th pr sent invention, 
such digestion being directed at one or more 
specific sites. Examples of suitable digesti ns 

65 are by the use of trypsin, chymotrypsin, cyan- 



ogen bromide, carboxy peptidase A, carboxy- 
peptidase S r carboxypeptidase Y and by the 
use of Edman degradation using phenyl iso- 
thiocyanate followed by cleavage with trifiuo- 

70 racetic acid. To bring the protein or protein 
product molecule into the effective routine 
mass range of the high mass mass spectro- 
meter, a reduction step can be carried out, 
e.g. by using dithiothreitol followed, if neces- 

75 sary, by alkylation, acidification, desalting and 
elution. 

According to a third aspect of the present 
invention there is provided a method of as- 
signing disulphide bridges in polymeric protein 
80 or protein products which method comprises 
the steps of: 

digesting a said protein or protein product in 
an unreduced state; 
subjecting the digest mixture or one or 
85 more fractionated components thereof to fast 
atom bombardment and high mass mass 
spectrometry; and 

identify disulphide bridged peptides by their 
characteristic masses. The interpretation may 
90 be confirmed by steps of subdigestion and/or 
reduction and/or Edman degradation and fur- 
ther fast atom bombardment and high mass 
mass spectrometry. The method can also be 
used to study the dynamics of protein denatu- 
95 ration/renaturation and the dynamics of pro- 
tein folding. It is to be understood that pep- 
tides resulting from a reduced disulphide 
bridge will have characteristic mass whilst the 
unreduced disulphide bridge will have a weight 
100 of the added masses less 2, this discrepancy 
resulting from the presence or absence of two 
hydrogen atoms. 

A preferred technique for conducting enzy- 
matic digestion is to subject from 10 pico- 
105 moles to 50 nanomoles of the protein to the 
enzyme, such as trypsin, in a 50 millimolar 
ammonium bicarbonate buffer at pH 8.5 with 
an enzyme: substrate ratio of 1:50 for from 1 
to 3 hours at 37°C or to an enzyme such as 
110 pepsi-n in 5% formic acid for 6 hrs. 

A preferred technique for conducting an Ed- 
man degradation is to subject the protein or 
protein product to treatment with phenyl iso- 
thiocyanate at pH 8.5 at 65°C for 1 hour fol- 
115 lowed by cleavage with trifluoroacetic acid at 
65°C for 10 minutes. This degradation can be 
repeated as many times as necessary until the 
peptides have been shortened to the required 
length, this length being monitored and as- 
120 signed by means of fast atom bombardment 
and high mass mass spectrometry. 

Preferably the high mass mass spectrometry 
is high field magnet mass spectrometry al- 
though any other suitable mass spectrometry 
125 technique or instrument may be employed, 
such as a time of flight instrument using Cali- 
fornium 252 fission bombardment. 

The fast atom bombardment and high mass 
mass spectrometry is preferably carried out 
130 using apparatus such as a VG ZAB HF or SE 



instalment preferably equipped with an M- 
SCAN fast atom bombardment gun, using Xe- 
non or cesium as the primary ion using beam 
(current of 20^ A at 8KeV). Samples of a size 
5 10 picomoies to 5 nanomoles in 2/i1 of 5% 
acetic acid are preferably loaded onto a gly- 
cerol coated probe tip preferably followed by 
1//1I of thioglycerol. The fast atom bombard- 
ment and high field magnet mass spectrome- 

10 try may be carried out directly upon the pro- 
tein digest or after separation by any suitable 
technique such as by using gel permeation 
chromatography or by using high pressure 
liquid chromatography. 

15 The accompanying diagram, which illustrates 
schematically complete and incomplete protein 
products, indicates how selective cleavage of 
the protein products can be used to determine 
the presence of ragged ends. 

20 The topmost protein molecule in the dia- 
gram may be regarded as complete; therefore 
selective cleavage near the C-terminus will 
produce a peptide fragment (I) of known pre- 
dictable mass. Such selective cleavage may be 

25 effected by any suitable specific cleaving 
agent e.g. by enzymes or chemically; the 
method employed in the example was to use 
cyanogen bromide which cleaves the peptide 
bond whose carbonyl function is contributed 

30 by a methionine (MET) residue. 

If the protein or protein product is not com- 
plete at its C-terminus (i.e. if it has undergone 
proteolytic nicking) as in the middle and lower 
molecules illustrated schematically in the dia- 

35 gram, then cleavage will produce other pep- 
tide fragments (II) and (III) whose masses 
would not be anticipated if only complete pro- 
tein or protein product was present. The pres- 
ence of the additional fragments II and 111 will 

40 thus reveal the presence of ragged ends and 
allow the degree of proteolytic nicking to be 
ascertained. These fragments may be quanti- 
fied either by carboxypeptidase B analysis of 
the intact proteins or by prior separation of 

45 the above assigned fragments combined with 
such methods as uV analysts and fast atom 
bombardment mass spectrometry or amino- 
acid analysis. 
As an example of the location and identifi- 

50 cation of a terminal fragment, a protein was 
digested under conditions as set out above 
with trypsin to cleave those peptide bonds in 
which the carboxyl derives from lysine (LYS) 
or arginine (ARG). 

55 The resultant digest was subjected without 
separation to fast atom bombardment and 
high field magnet mass spectrometry using the 
preferred apparatus and conditions described 
above. 

60 Subsequently a disgestion was carried out 
upon the tryptic digest using carboxypeptidase 
B (which cleaves terminal LYS or ARG amin 
acid residues which are not adjacent to proline 
(PRO)) under the above-menti ned preferr d 

65 conditions and further fast atom bombardment 
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and high field magnet mass spectrometry was 
effected, again using the preferred apparatus 
and conditions. 
A comparison f the spectrometry data of 

70 the respective digests shows that all masses 
shift by either 128(LYS) or 156(ARG) mass 
units except the C-terminal peptide or peptides 
terminating in PRO-ARG or PRO-LYS thus al- 
lowing location and characterization of the C- 

75 terminal peptide. If all the signals shift this 
indicates that the whole protein terminates in 
LYS or ARG or that a peptide is missing. 

The method of the present invention is 
more precise, much quicker (and hence less 

80 expensive) than existing traditional techniques, 
and can be conducted quantitatively. It allows 
gene products to be screened rapidly for er- 
rors of translation, mutations, deletions and 
insertions and may be used for routine quality 

85 control. 

Although the invention has been described 
with reference to proteins and protein pro- 
ducts, it is envisaged that it may have applica- 
bility to other biopolymers such as nucleotides 

90 and carbohydrates. 

CLAIMS 

1 . A method of analysing a polymeric pro- 
tein or protein product for the presence or 
95 absence of a ragged end (as hereinbefore de- 
fined) at its C- or - N-terminus which method 
comprises the steps of: 

(a) carrying out a chemical or enzymatic di- 
gestion upon a said Polymeric protein or pro- 

100 tein product to Produce N- or C-terminus frag- 
ments which are differentiable from non-termi- 
nal fragments of the digested protein or pro- 
tein product; 

(b) subjecting the N- and/or C-terminus frag- 
105 ments obtained to fast atom bombardment 

and high mass mass spectrometry; 

(c) analysing the mass spectrometric data 
obtained and checking for the occurrence of 
fragments having a mass which would be un- 

1 10 expected from a study of a known complete 
molecule of the protein or protein product 
which is believed to be present, the occur- 
rence of such unanticipated fragments indicat- 
ing the presence of a said ragged end, 

115 2. A method according to claim 1, wherein 
a chemical digestion is carried out using 
chemicals selected from the group consisting 
of (a) phenyl isothiocyanate followed by cleav- 
age with triflouracetic acid, and (b) CNBr. 

120 3. A method according to claim 1, wherein 
an enzymatic digestion is carried out using an 
enzyme selected from the group consisting of 
trypsin, chymotrypsin, carboxypeptidase A, 
carboxypeptidase B, and carboxypeptidase Y, 

125 4. A method according to claim 1 2 or 3 
wherein a preliminary reduction is effected 
upon the protein r prot in product, to bring 
the protein or protein product molecules into 
the effective routine mass range of the high 

130 mass mass spectromet r. 
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5. A meth d according to any prec ding 
claim, wherein steps (a) to (c) are repeated 
upon a said prot in or pr tein product using a 
second and different digestion thereby to 

5 cause cleavage at different sites and hence 
the formation of subfragments. 

6. A method according to any preceding 
claim, wherein fast atom bombardment is ef- 
fected using as the primary ionising beam 

10 atoms selected from the group consisting of 
Xenon atoms and cesium atoms. 

7. A method according to any preceding 
claim, wherein the high mass mass spectro- 
metry comprises high field magnet mass spec- 

15 trometry. 

8. A method according to any of claims 1 
to 6, wherein the high mass mass spectrome- 
try comprises Californium 252 fission bom- 
bardment. 

20 9. A method for locating the N- or C-termi- 
nal peptide in a polymeric protein or protein 
product which method comprises the steps of: 

(a) carrying out a chemical or enzymatic di- 
25 gestion upon a said product to produce N- or 
C-tetminus fragments which are differentiate 
(as hereinbefore defined) from non-terminal 
fragments, of the digested protein or protein 
product; 

30 (b) subjecting the protein fragments obtained 
to fast atom bombardment and high mass 
mass spectrometry; 

(d) subjecting the protein fragments of step 
(c) to fast atom bombardment and high mass 

35 mass spectrometry; and 

(e) analysing and comparing the mass spec- 
trometry data obtained from both mass spec- 
trometry exercises to establish whether or not 
some or all of the mass spectrometry signal 

40 peaks have shifted and deducing from this the 
location of the terminal peptide. 

10. A method of assigning disulphide 
bridges in polymeric protein or protein pro- 
ducts which method comprises the steps of: 

45 digesting a said protein or protein product in 
an unreduced state; 

subjecting the digest mixture or one or 
more fractionated components thereof to fast 
atom bombardment and high mass mass 
50 spectrometry; and 

identify disulphide bridged peptides by their 
characteristic masses. 

1 1. A method according to claim 1, 9 or 
10 which is carried out upon a nucleotide or 

55 carbohydrate instead of upon a polymeric pro- 
tein or protein product. 

12. A method of analysing a polymeric pro- 
tein or protein product, substantially as herein- 
before described. 

60 13. A method for locating the N- or C-ter- 
minal peptide in a polymeric protein or protein 
product, substantially as hereinbefore de- 
scribed. 

14. A method of assigning disulphide 
65 bridges in a polymeric protein or protein pro- 



duct, substantially as hereinbefor described. 

15. The features hereinbefor disclosed, or 
their quiyalents, in any novel selection. 
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